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Abstract 

The non-long terminal repeat (LTR) retrotransposon /, which belongs to the /superfannily of non-LTR retrotransposons, is well known 
in Drosophila because it transposes at a high frequency in the female germline cells in l-R hybrid dysgenic crosses of Drosophila 
melanogaster Here, we report the occurrence and the upregulation of an /-like element in the hybrids of two sister species belonging 
to the repleta group of the genus Drosophila, D. mojavensis, and D. arizonae. These two species display variable degrees of pre- and 
postzygotic isolation, depending on the geographic origin of the strains. We took advantage of these features to explore the 
transposable element (TE) dynamics in interspecific crosses. We fully characterized the copies of this TE family in the D. mojavensis 
genome and identified at least one complete copy. We showed that this element is transcriptionally active in the ovaries and testes of 
both species and in their hybrids. IVloreover, we showed that this element is upregulated in hybrid males, which could be associated 
with the male-sterile phenotype. 

Key words: hybrids, non-LTR retrotransposon, repleta group, phylogeny. 



Introduction 

Virtually all species harbor transposable elements (TEs), se- 
quences that are able to move within genomes and that 
differ in structure, activity, and copy number between popula- 
tions and species. TEs are classified in two broad classes: Class I 
and Class II, based on their mode of transposition. Class I is 
composed of two types of elements that transpose through 
RNA intermediates: elements with long terminal repeats (LTRs) 
and elements without these direct repeats, the non-LTR ele- 
ments, which can further be divided into two types: the long 
interspersed nuclear elements (LINEs) and the short inter- 
spersed nuclear elements (SINEs). Class II (DNA transposons, 
rolling circle transposons, and miniature inverted repeat ele- 
ments) is composed of TEs that transpose directly through a 
DNA molecule (Wicker et al. 2007; Kapitonov and Jurka 2008). 

The LINES are among the oldest genetic elements in eu- 
karyotes and are the most abundant in the human genome 
(Fawcett et al. 1985; Simonelig et al. 1988; Bucheton et al. 



1992; Han 2010). They consist of a 5'-untranslated region 
(UTR), which has promoter activity, two open-reading 
frames (ORFs) separated by a spacer, and a 3'-UTR with a 
poly-A tail. Our study focused on one /-like element, a LINE 
element first described in Drosophila melanogaster at the be- 
ginning of the 1970s (Picard 1976). The / element (originally 
called the / factor) is responsible for the hybrid dysgenesis 
syndrome in D. melanogaster females. This species presents 
two types of strains based on the presence and activity of this 
element: The / strains, harboring complete and functional / 
elements, and the R strains, with nonfunctional / elements. 
Crosses between / males (Inducer) and R females (Reactive) 
result in high rates of transposition in the germline of female 
offspring, known as SF females. These females are sterile be- 
cause most of the laid eggs fail to hatch due to the very high 
frequency of / element transposition, which causes genetic 
abnormalities such as failure in meiotic divisions and chromo- 
somal rearrangements (Bucheton etal. 1976, 1984; Bucheton 
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1990; Picard et al. 1978; Chaboissier et al. 1990, 1995). The 
dysgenesis caused by the / element in D. melanogaster is re- 
stricted to the female germline and has not been reported in 
males. Furthermore, the transposition rate correlates with the 
level of sterility in the SF females (Chaboissier et al. 1990; 
Seleme et al. 1999, 2006). 

The number of / copies in D. melanogaster is approximately 
30, but only five functional sequences are known to be 
dispersed on the chromosomal arms. The complete and func- 
tional copies are 5.4 kb long, possess two long ORFs with 
cysteine-rich motifs, apurinic-apyrimidic endonucleases, 
reverse transcriptase (RT), and RNase H domains, and the 
3'-end has several TAA repeats. These copies are expressed 
in the nurse cells of dysgenic ovaries, and the transcripts are 
transported into the oocytes, where retrotransposition occurs 
(Pelisson and Picard 1979; Fawcett et al. 1986; Vaury et al. 
1990; Seleme et al. 1999, 2005). In Drosophila, piwi-interact- 
ing RNAs (piRNAs) are important posttranscriptional regulators 
that are responsible for TE mRNA degradation (Khurana and 
Theurkauf 2010; Senti and Brennecke 2010; Kelleher et al. 
2012; Akkoucheetal. 2013; Dufourtetal. 2013). Specifically, 
the piRNAs are implicated in maternal / element control in 
Drosophila (Brennecke et al. 2008; Chambeyron et al. 2008). 

The implication of TEs in hybrid dysgenesis is one of the 
examples in which transposition may drive population isola- 
tion and is often proposed as one of the first steps of specia- 
tion (Kidwell and Novy 1979; Fontdevila 2005). However, the 
TE dynamics of expression and transposition have not been 
well described during this process. Hence, several questions 
remain unanswered regarding this subject. If TEs act in the first 
steps of speciation, what happens when crossing very closely 
related species that still have incomplete reproductive isola- 
tion? In particular, are /-like elements associated with the in- 
compatibility observed, and are they related to high rates of 
hybrid transposition as reported in other species (Ungerer et al. 
2009; Cavallini et al. 2010; Moschetti et al. 2010; Kelleher 
et al. 2012; Vela et al. 2014)? In this report, we made a first 
investigation to check whether /-like elements could poten- 
tially be involved in hybrid sterility. 

Drosophila mojavensis and its sister species analyzed in this 
study — D. arizonae — are a perfect pair of species with which to 
address these questions. They are found in the desert of the 
southwest United States and Mexico and share sympatric areas 
in southern Arizona and the State of Sonora (Mexico). These 
two species present three interesting features that facilitate the 
investigation of the above questions: 1) Their hybrids can be 
produced in the laboratory; 2) the genome of D. mojavensis 
has been sequenced, which allow us to analyze the sequences 
of its TEs; and 3) these species exhibit variable degrees of pre- 
and postzygotic isolation (Wasserman and Koepfer 1 977; Reed 
and Markow 2004; Massie and Markow 2005). Moreover, the 
male offspring of D. anzonae females and D. mojavensis males 
is sterile, but in the reciprocal cross, the hybrids have motile 
sperm depending on the origin of the D. mo/avens;s population 



that is used in the crosses (Reed and Markow 2004). 
Additionally, the strong prezygotic isolation is higher between 
flies from sympatric areas than between the allopatric flies, but 
the isolation level depends in part on the direction of the 
crosses and the geographic origin of the populations 
(Wasserman and Koepfer 1977; Ruiz et al. 1990; Reed and 
Markow 2004; Massie and Markow 2005). 

We found four potential full-length copies of /-like elements 
and reconstructed their phylogenetic relationships with other/ 
family members in Drosophila species, thus establishing the 
vertical and ancestral inheritance of the / element of 
D. mojavensis from a common ancestor shared with the mel- 
anogaster group. We also showed that the D. mojavensis /-like 
sequences are transcriptionally and transpositionally regulated 
in the hybrid female germline. However, a significant increase 
in transcription was detected in testes from hybrids coming 
from D. arizonae mothers. This is the first report that shows 
the activity of /-like elements in the male germline, and it 
suggests a link with the male-sterile phenotype observed in 
the male hybrids of the studied species. 

Materials and Methods 

Genomic Analyses 

To investigate the occurrence of /-like elements in the 
D. mojavensis sequenced genome, we used the canonical / 
sequence from D. melanogaster {l_D!\A}, which is available in 
the Repbase library Rel. 16.06, v.4 (Fawcett et al. 1986). This 
sequence was blasted against the scaffolds of the D. mojaven- 
sis genome (version-70, 13/dmoj_caf1) on the EnsembI 
Metazoa platform using BLASTN D. mojavensis (http://meta- 
zoa.ensembl.org/index.html, last accessed July 4, 2014). 

The hits were manually curated, and only the largest and 
putative complete copies (more than 2,000 bp) were se- 
lected. We then selected the D. mojavensis copies that 
were over 80% identical between them. The sequences 
with the above characteristics were selected and analyzed 
using the ORF-FINDER web tool (http;//www.ncbi.nlm.nih. 
gov/projects/gorf/, last accessed July 4, 2014) to identify 
putative coding regions. The ORFs identified were com- 
pared and confirmed by BLAST search for protein se- 
quences at the NCBI website, and the conserved protein 
domains were checked on the conserved domain database 
(http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, 
last accessed July 4, 2014). The sequence with the most 
complete structure was selected and hereafter referred to 
as "Reference Copy" or "RC" (fig. 1). 

The RC was used to search for /-like element copies in the 
D. mojavensis genome with identity (ID) higher than 80% and 
length greater than 200 bp. This criterion was chosen based 
on the 80-80-80 rule for TE family determination, as proposed 
by Wicker et al. (2007) for the identification of TEs belonging 
to the same family. The position of each copy (start and end) 
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on the genome scaffolds, the length of each fragment aligned 
by BLAST, and the number of the unidentified nucleotides (A/) 
were recorded (table 1). We performed multiple alignments of 
the gag and RT domains (nt) identified in the copies, and we 
determined the ID of each domain, based on those of the RC, 
using Bioedit version 7.0.4.1 (Hall 1999). A drawing of the 
copies is shown in figure 2 and represents the integrity of 
the putatively full-length copies compared with RC . To confirm 
that we recovered unique copies in the genome, the 5'- and 3'- 
end flanking sequences of each copy were retrieved, and their 
identities were determined compared with the RC flanking 
sequences. All copies corresponded to different insertions. 

To check the functional organization of all putatively full- 
length copies of D. mojavensis, the 5' -end sequences were 
compared with the sequence of the / promoter of D. melano- 
gasfer (Minchiotti et al. 1997) using BLAST2seq. 

The chromosomal location of the most complete / copies 
was inferred from the scaffold coordinates of each copy, and 
the correspondence of scaffolds with D. mojavensis polytene 
chromosome maps was performed using information found 
in Flybase.org (http://flybase.org/maps/chromosomes/maps. 
html, last accessed July 4, 2014) and in Schaeffer et al. 
(2008). For an intrachromosomal distribution of the copies, 
each D. mojavensis chromosome (X, 2, 3, 4, 5, and 6) was 
divided into three regions: Distal (1 0% of the sequence), cen- 
tral (80% of the sequence), and proximal (or centromeric, 
10% of sequence) segments in relation to the position of 
the centromere, to test the distribution of the copies in 
these regions (Marzo et al. 2013). 

The nomenclature used to identify each copy followed 
Wicker et al. (2007); for example, RII_DMJA1 consists of Rll 
(for the superfamily), DMJ (for D. mojavensis), and A to Z (for 
the clades formed by phylogenetic analyses, where the excep- 
tion is X, which was used for the undetermined groups), fol- 
lowed by the number of each copy within each clade. 

Evolutionary Analyses of RT Sequences 

We reconstructed the evolutionary relationships between the 
D. mojavensis/D. arizonae /-like sequences with all the / super- 
family members of Drosophila species deposited in the 



Repbase library (http:/AAww.girinst.org/repbase/, last accessed 
July 4, 2014). Each retrieved element was analyzed using ORF 
FINDER (http:/AAAAAA/.ncbi. nlm.nih.gov/gorf/gorf.html, last 
accessed July 4, 2014), and only the amino acid sequences 
of the second ORF were used for further analysis. We identi- 
fied the RT domain in each protein sequence using the struc- 
tural domain bank from the Pfam platform (http://pfam. 
sanger.ac.uk/, last accessed July 4, 2014). The Drosopfiila 
Repbase RT domains were aligned with the D. mojavensis 
and D. arizonae RT domains using MEGA5 (Tamura et al. 
2011). For D. mojavensis (strains #15081-1352.01 and 
#15081-132.26 from US San Diego Drosophila Stock 
Center) and D. arizonae (strains #15081-1271.17 and 
#15081-1271.18 from US San Diego Drosophila Stock 
Center), we amplified and sequenced the 627 bp RT sequence 
(RTJf: 5'-GCCTAGTCATCCCTATC-3' and RTJr: 5'-TGTTAGC 
GCCGGTTGTATT-3'), corresponding to the position 2,813 and 
3,198 in the D. mojavensis RC sequence. Polymerase chain 
reaction (PCR) cycling parameters were: 94°C for 3min, 40 
cycles of 94°C for 45 s, 56 °C for 45 min, and 72 °C for 1 min, 
followed by a 72 °C for 7 min. The fragments obtained were 
purified directly from the PCR product, using the GFX PCR 
DNA and Gel Band Purification Kit (GE Healthcare), and 
cloned with the TOPO TA Cloning Kit (Invitrogen). Five ran- 
domly chosen clones were sequenced using the Ml 3 universal 
primers. 

We performed the amino acid phylogenetic reconstruction 
using the maximum-likelihood method, JTT model, and 500 
bootstrap replicates as implemented in PHYML 3.0 (Guindon 
et al. 2010) at the ATGC: Montpellier Bioinformatics platform 
(http://www.atgc-montpellier.fr/, last accessed July 4, 2014). 
Two sequences of other LINE superfamilies were used as out- 
groups: The D. melanogaster Fw non-LTR retrotransposon 
(GenBank accession number: Ml 72 14.1) and the D. yakuba 
Helena non-LTR retrotransposon (GenBank accession number: 
AF01 2049.1). 

RNA Extraction and RT-quantitative PCR 

The fly samples were obtained from the US San Diego 
Drosophila Stock Center. We used strains of D. mojavensis 




TAAA<4i) -'3' 



152t)p 12B1bp B8bp 3BS3bp MTbfJ 



Fig. 1. — ^The diagram represents the structure of the putative "RC" (5,382 bp) of the /-like non-LTR retrotransposon identified in the Drosophila 
mojavensis genome. The boxes represent the two ORFs, 0RF1 and 0RF2. EN, endonuclease domain; RT, reverse transcriptase domain; RH, RNase 
H domain; and Crosshatch box, promoter region. 
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Table 1 

Summary of tlie / Element Copies in the Drosophila mojavensis Genome 



Copy 


Acc. 


Scaff. 


Start S. 


End S. 


Start Q. 


End Q. 


Or. 


Chr.arms 


Region 


N 


% (ID) 


% (ID Flanq.) 


Leng. 


Integ. 


RII_DMJA1(RCf 


CH933806.1 


6540 


12533998 


12539232 








2(E) 


Central 


0 






5,382 


Comp. 


RII_DMJA2= 


CH933808.1 


6496 


21912581 


21918262 


1785 


1052 




5(C) 


Central 


1,027 


0.999 


0.433 


4,505 


Comp. 


RII_DMJA3 


CH933807.1 


6500 


11558188 


1 1 563761 


1 


5382 


+ 


3(B) 


Central 


321 


0.9981 


0.43 


4,948 


Comp. 


RILDMJXI 


CH933815.1 


6482 


430575 


438784 


1 


5382 


+ 


N/A 


N/A 


5,092 


0.996 


0.447 


2,808 


Comp. 


RII_DMJX2 


CH935525.1 


12 


175 


1389 


1 


4456 




N/A 


N/A 


0 


0.99 


B/# 


1,390 


Def. 


RII_DMJX3 


CH937682.1 


97 


808 


1490 


4702 


5382 




N/A 


N/A 


0 


0.99 


B/# 


681 


Def. 


RII_DMJX4 


CH933809.1 


6680 


15471148 


15472992 


3547 


5382 


+ 


4(D) 


Central 


0 


0.997 


0.39 


1,848 


Def. 


RII_DMJX5 


CH933813.1 


6498 


3343118 


3344746 


3755 


5382 


+ 


N/A 


N/A 


4049 


0.996 


0.391 


1,629 


Def. 


RII_DMJX6 


CH933808.1 


6496 


9570337 


9570944 


4774 


5382 


+ 


5(C) 


Central 


0 


0.998 


0.417 


609 


Def. 


RII_DMJX7 


CH933814.1 


6308 


2487507 


2487762 


5128 


5382 




X(A) 


Telomeric 


0 


0.984 


0.385 


256 


Def. 



NoTt. — Acc, accession number in GenBank; Scaff., scaffold; Start S., scaffold start hit; End S., scaffold end hit; Start Q., query start hit; End Q., query end hit; Or., 
orientation of the copy in genonne; Chr.arms, chromosome arms and conventional numbering with Muller syntenic elements in parentheses; Region, chromosome region; N, 
number of unidentified nucleotides; % ID, nucleic percent ID to RC; % ID Flanq., nucleic percent ID the 5'-flanking region to RC flanking region; Leng., total alignment 
length; Integ., copy integrity: Comp., putatively complete; Def., defective copies; N/A, not annotated; #, not computed. 

^Sequence used in phylogeny (fig. 4). 
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Fig. 2. — Structure of the putatively complete copies of the /-like non-LTR retrotransposon identified in the Drosophila mojavensis genome. DMJA1 to XI , 
copies shown in table 1; shaded gray boxes, unidentified nucleotides (N); red boxes, 5' -end promoter position; green boxes, 0RF1; blue boxes, 0RF2; scale 
size range of the copies is presented in base pairs. 



(Anza Borrego Desert, CA; stock number: 15081-1352.01) 
and D. arizonae (Metztitlan, Hidalgo, Mexico; stock number: 
15081-1271.17) for crosses. These two allopatric strains pre- 
sent incomplete prezygotic isolation, and we could obtain 
enough F1 hybrids for both reciprocal crosses. In crosses be- 
tween D. arizonae females and D. mojavensis males, F1 males 
are sterile. These populations were maintained at 25 °C in 
mass cultures. One- to two-day-old virgins (females and 
males) were isolated from the parental lines and split in 
vials. Subsequently, the crosses were performed 
(D. mq/ai/ens/s females x D. arizonae males and the reciprocal 
cross) using ten couples per cross. F1 hybrids were isolated, 
and 5-day-old females and males were used for DNA and RNA 
extraction. 

The flies were dissected, and their tissues were placed in 
1 X phosphate buffer saline. The RNA was extracted from 
pools of 20 pairs of ovaries and from 30 testes from parental 
and hybrids. Two independent sets of crosses were per- 
formed. For each, three replicates were performed to extract 



total RNA using the RNeasy kit (Qiagen) and treated with 
DNase (the DNA-free kit; Ambion). One microgram of the 
total RNA was then converted into cDNA using the 
Thermoscript Invitrogen kit primed with a mix of oligo(dT)20 
and random primers. 

The expression level of the RT of the /-like elements was 
measured by RT-quantitative (qPCR) using the following pri- 
mers: Forward 5'-ATC CAC TCT TCA ACG GCA TC-3' and 
reverse 5'-TGG ACG ATA TGG TGC AAA TG-3'. We verified 
by sequencing (the TOPO TA cloning kit; Invitrogen) that no 
sequence polymorphism was present in the primers that could 
affect the PGR efficiency. Five clones of each strain were se- 
quenced (Applied Biosystems 3730XL), and the sequences 
obtained were aligned using BLAST alignment at the NCBI 
website. The cDNA was diluted 50 times, and the relative 
mRNA level was quantified using SYBR green qPCR in a 
LightCycler 480 apparatus (Roche Diagnostics). The transcript 
quantity was estimated relative to the rp49 reference gene of 
D. mojavensis using the primers described by Granzotto et al. 
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(2009). The RT-qPCR experiments were performed with tech- 
nical triplicates. Only RT-qPCR experiments with efficiencies 
greater than 1 .9 were retained. 

Fluorescent In Situ Hybridization 

/transcripts were detected by in situ hybridization in 5-day-old 
ovaries of parents and hybrids according to Akkouche et al. 
(2012). Ovaries were hybridized with a 952-bp riboprobe that 
corresponded to the RT and RNA-H domains in the RC and 
included a T3 promoter site in the reverse primer (forward: 
5'-GCA ATA CAA CCG GCG CTA ACA-3', reverse: 5'-TGG 
CTG TGG ATT TGG CTG TGA ATT AAC CCT CAC TAA AGG 
GA-3'). For the in vitro transcription, a digoxigenin (DIG)-RNA 
labeling mix (Roche) was used. After hybridization and strin- 
gency washes, an anti-DIG POD antibody mix (ROCHE) was 
used for chemiluminescent detection, with a fluorescence am- 
plification (the Tyramide Signal Amplification Kit; 
PerkinElmer). The DNA was labeled by SYTOX Green Nucleic 
Acid Stain (Life Technology), and the visualization was per- 
formed with a Zeiss LSM51G Meta confocal microscope. 
Tissues without anti-DIG treatment were used as a negative 
control, since no label is expected without this secondary an- 
tibody. Five-day-old testes hybridization was performed with 
the same / element probe according to Morris et al. (2009). To 
be able to localize the / transcripts in the testis, we used Cyclin 
B as a positive control. The Cyclin B gene is expressed at very 
high levels in primary spermatocytes, whereas the mitotic cells 
in the germinal proliferation center present lower levels of 
expression (Morris et al. 2009). We used a Cyclin B riboprobe 
of 730-bp in which primers were designed for the D. moja- 
vensis gene and include a T3 promoter site (forward: 5'-CGA 
TGT CCT TGT CCA CCA AA-3', reverse: 5'-GCA GCC GTA 
TAA CGG GAA TAG ATT AAC CCT CAC TAA AGG GA-5') 
(supplementary fig. SI, Supplementary Material online). 
The testes images were processed in a Zeiss Axioskop 2 mot 
plus microscope and analyzed with Axiovision Release 4.8 
software. 

TE Activity (Transposon Display) 

Transposon display (TD) was performed with protocols 
adapted from several reports (Munroe et al. 1994; Esnault 
et al. 2008; Akkouche et al. 2012). Total genomic DNA was 
isolated from single individuals used in crosses between 
D. mojavensis females and D. arizonae males, and the F1 
single hybrids were backcrossed with D. arizonae males to 
obtain the F2 generation (supplementary fig. S2, 
Supplementary Material online). We could only perform 
crosses in this direction because individual crosses between 
single D. arizonae females with single D. mojavensis males 
did not give any offspring. This is because flies will not mate 
under the low density condition of single pair crosses. From 
the six individual crosses performed with the parental lines, we 
were able to follow two independent families. Individuals 



analyzed by TD are displayed in supplementary table SI, 
Supplementary Material online. The TD technique also al- 
lowed us to have an estimation of the copy number in each 
strain. 

The genomic DNA (100ng/|.il) was digested with Hindlll 
(10U/|.il, 3h at 37 °C). The MSBE adapter was linked with 
the Hindlll enzyme adaptor site in one double-stranded reac- 
tion (Hindlll-MSEB) using 100|iM of Hindlink, lOO^M of 
MSEB, SSC 20x, and 1 M Ths in total volume of 333 ^1. The 
product was denatured for 5 min in 92 °C and cooled at room 
temperature. The linker Hindlll-MSEB was ligated to digested 
DNA with a T4 DNA ligase (10 units), enzyme buffer (5 x), and 
digested DNA in a 50-|.il reaction for 3 h at 23 °C. DNA was 
amplified with a primer adapter (LNP 5'-GAA TTC GTC AAC 
ATA GCA TTT CT-3') and a forward primer inside of the 3'-end 
of the / element (5'-TAA CTG TCC TGC AAC TTC CCA CCT- 
3'). The amplification conditions were as follows: denaturation 
at 94°C, 2min; followed by 94°C, 30s; 59°C, 1 min; 72°C, 
1 min; 35 cycles; then, 72 °C, 10 min. Subsequently, the prod- 
uct amplification was run on a gel, and analyses were per- 
formed with QIAxcel ScreenGel 1 .1 .0 software. 

The TD method was tested with D. mojavensis genomic 
DNA, extracted from a single flies, that was digested by 
Hindlll and amplified by PGR with Hindlll-MSEB and / primers. 
Then, two fragments (842 and 450 bp), identified in silico and 
observed in electrophoresis, were cloned (with five clones for 
each fragment), amplified by PGR with Ml 3 universal primers, 
and sequenced. The alignment of these sequences against the 
D. mojavensis RC confirmed the veracity of the fragments 
obtained by enzyme digestion and amplification for the / 
element. 

Results 

Our study revealed the presence of an /-like element in 
D. mojavensis and D. arizonae that is closely related to the / 
element found in the genomes of the D. melanogaster sub- 
group. In silico and in vivo analyses showed the occurrence of 
complete and transcriptionally active copies in both species 
and in their hybrids. 

The / Element of D. mojavensis 

We detected 10 /-like copies in the D. mojavensis genome 
sequence (table 1). One potentially complete copy, called RC 
(RILDMJAI), is 5,382-bp long and contains two ORFs (ORE 1 
and ORE 2) with no internal stop codons and with all the 
expected protein domains for this type of non-LTR retrotran- 
sposon (Malik et al. 1999; fig. 1). It has 62.42% nucleic ID 
with the D. melanogaster I element and is only 7 bp longer. 
We identified three other copies that are most likely complete: 
RII_DMJA2, RII_DMJA3, and RII_DMJX1 (table 1 and fig. 2), 
which show high ID and conserved protein domains com- 
pared with the RC, but their sequences were interrupted 
with unidentified nucleotides (N). Similarly, the RII_DMJX5 
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Sites 





1 2 


3 


4 


5 


6 


7 8 
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copy, with 1 ,629 bp aligned against RC, may also be complete 
because it has a long N stretch at its 5'-end. The other copies 
were small and fragmented by insertions and deletions. 
The identification of the relative chromosome position of 
these copies, as established by analyzing the polytene chro- 
mosome maps found in Flybase (Schaeffer et al. 2008), re- 
vealed that three of the four putative full-length copies, 
RILDMJAI, RII_DMJA2, and RII_DMJA3, were inserted in 
the central portion of different chromosome arms correspond- 
ing to chromosomes 2, 3, and 5, respectively. 



Moreover, we looked for the / element promoter in the 
D. mojavensis copies by aligning the flanking sequences of 
each copy with the promoter sequence of the / element of 
D. me/anogasfer (Minchiotti et al. 1997; Han 2010). We iden- 
tified a putative promoter in the RC that has the same length 
(41 bp) and is well conserved (76% ID) compared with that of 
the D. melanogaster I element. This promoter region is identical 
in the four putative full-length copies found in the D. mojavensis 
genome (fig. 3). The four longest / sequences have a poly-A tail 
in the 3'-UTR, with an average of four TAAA repeats. 
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Phylogenetic Relationship between the D. mojavensis 
/-Like Element and the / Elements of Other 
Drosophila Species 

The phylogenetic relationship of the different /-lil<e elements 
was reconstructed using the RT protein sequences 
(see Materials and Methods for details) (fig. 4). It allowed us 
to distinguish four major clades of sequences (A-D). Different 
/-like sequences are present in species of the melanogaster 
group of the subgenus Sophophora (see supplementary fig. 
S3, Supplementary Material online, for the species phylogeny), 
which are included in all four clades (fig. 4). Clade A can be 
divided into two groups of sequences (Clade Al, bootstrap: 
70% and Clade A2, bootstrap: 99%). Inside Clade Al , which 
branches in two smaller clades (Ala and Alb), all the /-like 
sequences of the D. mojavensis and of D. arizonae cluster in a 
highly supported clade (bootstrap: 1 00%), which clusters with 
the D. Wn7/s sequence in a well-supported monophyletic group 
(Al b: 99%). This clade is closely related to the group of se- 
quences from the melanogaster group (Clade Ala), which 
includes the canonical / element from D. melanogaster 
(l_DM}. These two groups of sequences (Ala and Alb) form 
a sister group of Clade A2. This clade includes sequences 
found in the same species that form Clade Ala with the ex- 
ception of D. biarmipes and of D. persimllis, a species of the 
obscura group. The other three clades (B, C, and D) include 
more divergent /-like sequences from species belonging only 
to the Sophophora subgenus: Clade B includes only species 
from the melanogaster group, whereas Clades C and D also 
include sequences from D. willistoni (the willistoni species 
group). The phylogeny reveals that different /-like families 
are harbored by the same species, as is the case for D. kikka- 
wai, whose sequences branch into three different clades (Al , 
A2, and C). 

Activity of the Complete / Copies 

We observed / element mRNA in ovarian cells by immunoflu- 
orescence and detected the presence of transcripts in both 
parents and hybrids. The / transcripts had the same localization 
in both species, with an accumulation in the nurse cells and no 
labeling of the oocytes. We did not detect any signal in the 
follicle cells (fig. 5). In testes, we observed labeling in parental 
lines and both types of hybrids (fig. 6). Transcripts were spe- 
cifically located in primary and meiotic spermatocyte cells 
(Fuller 1998). Since in situ hybridization is not a quantitative 
technique, to investigate whether the levels of expression 
were equal between the different lines (parents and hybrids), 
we performed expression analyses by RT-qPCR. The results 
showed transcription of the / elements in D. mojavensis and 
in D. arizonae, both in ovaries and testes (fig. 7). The levels of 
transcription between the parental lines and the reciprocal 
male and female hybrids were not significantly different be- 
tween ovaries from parents and hybrids (two-way analysis of 
variance [ANOVA]). However, expression levels were 



significantly higher in hybrid testes produced by crosses be- 
tween D. arizonae females and D. mojavensis males than in 
the reciprocal cross (f(3, 9) = 84.78, P< 0.0001). We obtained 
the same results with crosses and expression analyses per- 
formed again 6 months later (supplementary fig. S4, 
Supplementary Material online). 

To detect transposition, we performed TD analyses. This 
experiment needs to be performed with individual crosses, 
in order to follow new transpositions that arrived in the germ- 
line cells, and that will be present in all the offspring of the two 
parents. Unfortunately, these experiments could not be per- 
formed for both reciprocal crosses because individual crosses 
between single D. arizonae females with single D. mojavensis 
males did not give any offspring. This cross is only possible 
with a mass of individuals. TD was thus only performed in one 
cross direction (supplementary table SI and fig. SI, 
Supplementary Material online). TD also allows determining 
the copy number. In the individuals that we analyzed, 
D. mojavensis females had one copy of /-like element and 
D. arizonae males had from three to five copies of the ele- 
ment. In agreement with the RT-qPCR results, no transposi- 
tion was detected in the F1 tissues, which indicates that the / 
element does not transpose in the parental gonads. 
Additionally, no transposition was detected in the F2 genera- 
tion, indicating that neither the / sequences in F1 females nor 
in D. arizonae males induce transposition (data not shown). 
However, given the limitations of TD technique, we cannot 
exclude that somatic transposition occurred, but we were 
unable to measure this with confidence. 

Discussion 

Identification and Phylogenetic Analysis of /-Like Elements 

In this report, we identified and characterized /-like sequences 
in the sequenced genome of D. mojavensis. In addition to 
several defective and divergent sequences, we identified 
four putatively complete copies in this species, but only one 
showed all typical structural domains of the / superfamily 
(Malik et al. 1999). This finding suggests that this copy may 
also have transposition ability. In D. melanogaster, two intact 
ORFs are a requisite for protein encoding and complete retro- 
transposition of the / factor (Seleme et al. 2005). The presence 
of unidentified nucleotides in some of the detected copies in 
D. mojavensis suggests the possibility that other sequences are 
potentially full-length copies. One argument favoring this hy- 
pothesis is the conservation of the promoter region that we 
identified, which is typically not conserved between species 
(Minchiotti et al. 1997). This promoter region is more con- 
served between the copies from D. mojavensis and D. mela- 
nogaster (76%), than the RT sequences (66%), which is 
puzzling. Further work should be done to understand if this 
could be related to the testis-specific expression in the hybrid 
male offspring. 
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Fig. 5. — Immunofluorescence for the / element mRNA probe (RNA in red and DNA in green) in the egg chamber of the parental and hybrid crosses. Oo, 
oocyte; NC, nurse cells; Fc, follicle cells. The arrows indicate mRNA of the detected / transcripts. 



Non-LTR retrotransposons are common elements in 
Drosophila genomes and are often described as vertically in- 
herited in this group of species (Malik et al. 1999; Granzotto 
et al. 2011). /-like element sequences have been found in 
many species, and their distribution correlates with the phylo- 
genetic relationships between species (Bucheton et al. 1992), 
which indicates that they are old components of Drosophila 
genomes. However, / elements with a structure strikingly sim- 
ilar to that of D. melanogaster {l_DM) occur only in the species 



of the melanogaster subgroup (D. simulans, D. mauritiana, 
and D. sechellia). The most widely supported hypothesis is 
that ancestral / elements were lost from the genome of 
D. me/anogasfer but were then reintroduced recently because 
older strains of D. melanogaster contain only defective and 
immobile / sequences, and all the strains collected after 1930 
have both types of sequences (defective and active mobile / 
elements). This observation is in agreement with the hypoth- 
esis that the complete and active / factor progressively invaded 
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Fig. 6. — Immunofluorescence for the / element mRNA probe (RNA in red and DNA in green) in testes of the parental and hybrid lines. Yellow arrows 
indicate detected mRNA transcripts. Sc.Mt.c, sperm and mitotic cells; first spty, primary spermatocytes; Me.sp., meiotic spermatocytes; R.sp, round 
spermatids; Di.sp, differentiating spermatids; El.sp, elongating spermatids; horizontal arrow indicates the direction of differentiation (stem cells are located 
at the apical tip and germ cells move toward the basal end as they differentiate). 



D. melanogaster aiter this period (Kidwell 1983). The process 
by which / elements have reinvaded the D. melanogaster 
genome is unknown. Horizontal transfers from another spe- 
cies (Simonelig et al. 1 988) is one of the possibilities, but more 
complex hypotheses, such as the reactivation of sequences 
sequestered in the heterochromatin, should also be 
considered. 

The phylogeny placement of the /-like elements from D. 
mojavensis is generally in agreement with the species phy- 
logeny. All the /-like sequences of D. mojavensis and D. ah- 
zonae branch together with the sequence of D. virilis, as 
expected based on the species phylogeny because these spe- 
cies belong to the virilis-repleta radiation of the subgenus 
Drosophila suggesting a vertical inheritance of these se- 
quences in this Drosophila radiation. However, when we an- 
alyze the global phylogeny of the /-like element, we cannot 
exclude that some horizontal transfer events may have oc- 
curred between other Drosoptiila species. For example, this 
could have been the case between D. ficuspliila and D. bipec- 
tinata (see Clade Ala in fig. 4). 

The /-like element superfamily is a very old component of 
the genome of Drosopliila, and its diversification into several /- 
like families most likely occurred very early in the evolution of 
Drosopliila, most likely before the separation of the 
Sophophora and Drosophila subgenera. More diversification 



occurred in the Sophophora subgenus, which was accompa- 
nied by sequence loss and horizontal transfer events. In the 
Drosophila subgenus, we can only assume that random loss of 
the / families lead to only one remaining family, which is de- 
scribed in this article and which is closely related to the / factor 
from D. melanogaster. 

Activity of the / Element 

We measured the transcriptional activity of the / elements by 
RT-qPCR and fluorescent in situ hybridization (FISH) experi- 
ments in the parental lines and their hybrids. The level of / 
element mRNA was low in ovaries and testes of the parental 
lines. In the parental ovaries, the / element transcripts accu- 
mulate in the nurse cells. This type of labeling is also observed 
for the / element in Drosophila ovaries (Seleme et al. 2005; 
Chambeyron et al. 2008). However, transcription of the / el- 
ement was never reported in testes in this species. Our RT- 
qPCR analysis shows that the / element is transcribed in testes 
of both D. mojavensis and D. arizonae; this was confirmed by 
FISH, which showed labeling in primary and meiotic sperma- 
tocytes. In the reciprocal hybrids between the two species, the 
levels of expression were identical for the female germline 
tissues, with no difference between parents and F1 hybrids. 
FISH experiments also confirmed this result because no 
change in the localization of the transcripts and no labeling 
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Fig. 7. — Expression of /-like elements by RT-qPCR in ovaries (A) and testes (6) of Drosophila mojavensis, D. arizonae, and their hybrids (F for female and 
M for male). The transcript levels of the / element were estimated relative to those of the rp49 gene. No significant differences were observed in ovaries (A). 
On the contrary, in testis, mRNA levels were significantly different (two-way ANOVA). 



in the oocytes were observed. For the male germline, the re- 
sults were quite different depending on the direction of the 
cross. F1 males from D. arizonae male parents showed no 
difference in the expression of the / element in testes. 
However, when D. mojavensis males were used as parents, 
a significant increase in the / element expression in testes was 
observed. The fact that F1 males in this particular cross are 
sterile (Ruiz et al. 1990) suggests a role for TEs in the male- 
sterile phenotype. Furthermore, it is important to determine 
whether the high expression in F1 hybrids observed in RT- 
qPCR is derived from complete copies of D. mojavensis 
males. Hybrid dysgenesis in D. melanogaster involves transpo- 
son-naive females and active transposon-containing males in 
several different systems: The l-R system, the P-IVI system, and 
even the H-f system (Bucheton et al. 1 976; Picard et al. 1 978; 
Kidwell and Novy 1979; Kidwell 1983; Streck et al. 1986). 
However, at least in the l-R and P-M systems, it is the mater- 
nal line that is affected. 

The presence of /transcripts only in the nurse cells in ovaries 
of D. mojavensis, D. arizonae, and their hybrids suggests that 
the / element in both species is most likely regulated by the 
piRNA pathway. In Drosophila ovaries, Chambeyron et al. 
(2008) showed that there is an association between the ac- 
cumulation of piRNA and the regulation of the / element tran- 
scripts in the nurse cells, where the transcripts are processed, 
before being transported to the oocytes, where the retrotran- 
sposition occurs. Such findings suggest a posttranscriptional 
regulation of the / element described here before its 



transportation to the oocytes, as reported in D. melanogaster 
(Brennecke et al. 2008; Chambeyron et al. 2008). 

In testes, the / transcripts in parents and hybrids are located 
in primary spermatocytes, where high levels of transcription 
for genes encoding proteins for spermatogenesis and genes 
encoding male germline-specific isoforms are detected (Fuller 
1998). The transcripts of many genes are stable and are pre- 
sent for days after meiosis but then decrease in the postmei- 
otic spermatids stage, during which proteins can be detected 
(Kuhn et al. 1988; Fuller 1998). The localization of the /-like 
transcripts observed in this study (fig. 7) corresponds to the 
transcriptional dynamics described for their genes. Other ele- 
ments have transcripts detected in primary spermatocytes. 
Transcripts of 412 retrotransposon were not detectable in 
ovaries by in situ hybridization in D. melanogaster and 
D. simulans populations but were detected in the male germ- 
line in primary spermatocytes (Borie et al. 2002). Furthermore, 
no association has been found between expression in soma 
and in testes, which indicates independent regulation (Borie 
et al. 2002). The same localization patterns were observed for 
the retrotransposons 7737, GATE, mdgl , and copia (Haoudi 
et al. 1997; Kogan et al. 2003; Morozova et al. 2004; 
Kalmykova et al. 2005). The relatively high number of tran- 
scribed elements, including the D. mojavensis I element, in 
primary spermatocytes reveals the importance of these cells 
in the transcription of these TEs. 

In this work, we have investigated the hypothesis that /-like 
elements (which are implicated in the hybrid dysgenic 
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incompatibility of D. melanogaster) could be activated in hy- 
brids between D. mojavensis and D. ahzonae, two recently 
diverged species from the Drosophila subgenus of the genus 
Drosophila. We showed that / elements are specifically acti- 
vated in testes from a specific direction of the intercross, and 
we suggest that this activation may be associated with the 
observed male sterility. Moreover, we showed that this acti- 
vation is sex specific because the female germline does not 
seem to be affected by this phenomenon. The impact of TEs in 
the male sterility observed in interspecific crosses is becoming 
clearer and needs further investigations. Studies using closely 
related species will be helpful to understand the first steps of 
the male sterility process and the role of TEs. 

Supplementary Material 

Supplementary figures S1-S4 and table SI are available at 
Genome Biology and Evolution online (http://wvvw.gbe. 
oxfordjournals.org/). 
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